Distinct Mutational Behaviors Differentiate Short Tandem Repeats from Microsatellites in the Human Genome
نویسندگان
چکیده
A tandem repeat's (TR) propensity to mutate increases with repeat number, and can become very pronounced beyond a critical boundary, transforming it into a microsatellite (MS). However, a clear understanding of the mutational behavior of different TR classes and motifs and related mechanisms is lacking, as is a consensus on the existence of a boundary separating short TRs (STRs) from MSs. This hinders our understanding of MSs' mutational properties and their effective use as genetic markers. Using indel calls for 179 individuals from 1000 Genomes Pilot-1 Project, we determined polymorphism incidence for four major TR classes, and formalized its varying relationship with repeat number using segmented regression. We observed a biphasic regime with a transition from a faster to a slower exponential growth at 9, 5, 4, and 4 repeats for mono-, di-, tri-, and tetranucleotide TRs, respectively. We used an in vitro mutagenesis assay to evaluate the contribution of strand slippage errors to mutability. STRs and MSs differ in their absolute polymorphism levels, but more importantly in their rates of mutability growth. Although strand slippage is a major factor driving mononucleotide polymorphism incidence, dinucleotide polymorphism incidence is greater than that expected due to strand slippage alone, indicating that additional cellular factors might be driving dinucleotide mutability in the human genome. Leveraging on hundreds of human genomes, we present the first comprehensive, genome-wide analysis of TR mutational behavior, encompassing several motif sizes and compositions.
منابع مشابه
Mature Microsatellites: Mechanisms Underlying Dinucleotide Microsatellite Mutational Biases in Human Cells
Dinucleotide microsatellites are dynamic DNA sequences that affect genome stability. Here, we focused on mature microsatellites, defined as pure repeats of lengths above the threshold and unlikely to mutate below it in a single mutational event. We investigated the prevalence and mutational behavior of these sequences by using human genome sequence data, human cells in culture, and purified DNA...
متن کاملWhat Is a Microsatellite: A Computational and Experimental Definition Based upon Repeat Mutational Behavior at A/T and GT/AC Repeats
Microsatellites are abundant in eukaryotic genomes and have high rates of strand slippage-induced repeat number alterations. They are popular genetic markers, and their mutations are associated with numerous neurological diseases. However, the minimal number of repeats required to constitute a microsatellite has been debated, and a definition of a microsatellite that considers its mutational be...
متن کاملDomain-level differences in microsatellite distribution and content result from different relative rates of insertion and deletion mutations.
Microsatellites (short tandem polynucleotide repeats) are found throughout eukaryotic genomes at frequencies many orders of magnitude higher than the frequencies predicted to occur by chance. Most of these microsatellites appear to have evolved in a generally neutral manner. In contrast, microsatellites are generally absent from bacterial genomes except in locations where they provide adaptive ...
متن کاملA threshold size for microsatellite expansion.
When is a DNA repeat sequence a microsatellite? Microsatellites are tandem arrays of short (1–5 bp) repeats, characterized by rapid expansion and contraction through a process of ‘‘dynamic mutation’’ (Sutherland and Richards 1995). Despite their importance in modern genetic analyses (Bruford and Wayne 1993; Dib et al. 1996) and association with several human genetic diseases (Mandel 1994), litt...
متن کاملDetecting short tandem repeats from genome data: opening the software black box
Short tandem repeats, specifically microsatellites, are widely used genetic markers, associated with human genetic diseases, and play an important role in various regulatory mechanisms and evolution. Despite their importance, much is yet unknown about their mutational dynamics. The increasing availability of genome data has led to several in silico studies of microsatellite evolution which have...
متن کامل